Inferring actor communities from videos
نویسندگان
چکیده
In recent years there has been a growing interest in inferring social relations amongst actors in a video using audiovisual features, co-appearance features or both. The discovered relations between actors have been used for identifying leading roles, detecting rival communities in a movie plot etc. In this paper we propose an unsupervised method which uses the video’s transcript and closed caption information for discovering actor communities (group of actors or characters in a film that share a common perspective/viewpoint on an issue) from videos. The method proposed groups together actors using a topic model based approach, which jointly models actor-actor interaction (two actors interact when they share the same scene) and the topics associated with their conversations/dialogs. This joint modeling approach shows encouraging results compared to existing methods.
منابع مشابه
Actor and Observer: Joint Modeling of First and Third-Person Videos
Several theories in cognitive neuroscience suggest that when people interact with the world, or simulate interactions, they do so from a first-person egocentric perspective, and seamlessly transfer knowledge between third-person (observer) and first-person (actor). Despite this, learning such models for human action recognition has not been achievable due to the lack of data. This paper takes a...
متن کاملA database of whole-body action videos for the study of action, emotion, and untrustworthiness
We present a database of high-definition (HD) videos for the study of traits inferred from whole-body actions. Twenty-nine actors (19 female) were filmed performing different actions-walking, picking up a box, putting down a box, jumping, sitting down, and standing and acting-while conveying different traits, including four emotions (anger, fear, happiness, sadness), untrustworthiness, and neut...
متن کاملInferring Shared Attention in Social Scene Videos
This paper addresses a new problem of inferring shared attention in third-person social scene videos. Shared attention is a phenomenon that two or more individuals simultaneously look at a common target in social scenes. Perceiving and identifying shared attention in videos plays crucial roles in social activities and social scene understanding. We propose a spatial-temporal neural network to d...
متن کاملAction Understanding with Multiple Classes of Actors
Despite the rapid progress, existing works on action understanding focus strictly on one type of action agent, which we call actor—a human adult, ignoring the diversity of actions performed by other actors. To overcome this narrow viewpoint, our paper marks the first effort in the computer vision community to jointly consider algorithmic understanding of various types of actors undergoing vario...
متن کاملLearning Social Relations from Videos: Features, Models, and Analytics
Despite the progress made during the last decade in video understanding, extracting high-level semantics in the form of relations among the actors in a video is still an under-explored area. This chapter discusses a streamlined methodology to learn interactions between actors, construct social networks, identify communities, and find the leader of each community in a video sequence from a socio...
متن کامل